Collection Ranking and Selection for Federated Entity Search
نویسندگان
چکیده
Entity search has emerged as an important research topic over the past years, but so far has only been addressed in a centralized setting. In this paper we present an attempt to solve the task of ad-hoc entity retrieval in a cooperative distributed environment. We propose a new collection ranking and selection method for entity search, called AENN. The key underlying idea is that a lean, name-based representation of entities can efficiently be stored at the central broker, which, therefore, does not have to rely on sampling. This representation can then be utilized for collection ranking and selection in a way that the number of collections selected and the number of results requested from each collection is dynamically adjusted on a per-query basis. Using a collection of structured datasets in RDF and a sample of real web search queries targeting entities, we demonstrate that our approach outperforms state-of-the-art distributed document retrieval methods in terms of both effectiveness and efficiency.
منابع مشابه
NTNUiS at the TREC 2014 Federated Web Search Track
This paper describes our participation in the Federated Web Search track at TREC 2014. For the resource selection task we employ a learning-to-rank approach to combine various (instantiations of) resource ranking models. For the vertical selection task we treat the estimated collection relevance scores as binary judgements.
متن کاملOpinions in Federated Search: University of Lugano at TREC 2014 Federated Web Search Track
This technical report presents the work carried out at the University of Lugano on TREC 2014 Federated Web Search track. The main motivation behind our approach is to provide better coverage of opinions that are present in federated resources. On the resource selection and vertical selection steps, we apply opinion mining to select opinionated resources/verticals given a user’s query. We do thi...
متن کاملOverview of the TREC 2014 Federated Web Search Track (DRAFT)
The TREC Federated Web Search track facilitates research in topics related to federated web search, by providing a large realistic data collection sampled from a multitude of online search engines. The FedWeb 2013 challenges of Resource Selection and Results Merging challenges are again included in FedWeb 2014, and we additionally introduced the task of vertical selection. Other new aspects are...
متن کاملFederated Entity Search Using On-the-Fly Consolidation
Nowadays, search on the Web goes beyond the retrieval of textual Web sites and increasingly takes advantage of the growing amount of structured data. Of particular interest is entity search, where the units of retrieval are structured entities instead of textual documents. These entities reside in different sources, which may provide only limited information about their content and are therefor...
متن کاملLearning to Combine Collection-centric and Document-centric Models for Resource Selection
This paper describes our participation in the Federated Web Search track at TREC 2014. Our main focus is on the resource selection task, where we employ a learning-to-rank approach to combine various (instantiations of) resource ranking models. Further, we show that vertical selection can be run on the output from resource selection, and that it directly benefits from the improvements of thereof.
متن کامل